ELZAR: Triple Modular Redundancy using Intel AVX

نویسندگان

  • Dmitrii Kuvaiskii
  • Oleskii Oleksenko
  • Pramod Bhatotia
  • Pascal Felber
  • Christof Fetzer
چکیده

Instruction-Level Redundancy (ILR) is a well known approach to tolerate transient CPU faults. It replicates instructions in a program and inserts periodic checks to detect and correct CPU faults using majority voting, which essentially requires three copies of each instruction and leads to high performance overheads. As SIMD technology can operate simultaneously on several copies of the data, it appears to be a good candidate for decreasing these overheads. To verify this hypothesis, we propose ELZAR, a compiler framework that transforms unmodified multithreaded applications to support triple modular redundancy using Intel AVX extensions for vectorization. Our experience with several benchmark suites and real-world case-studies yields mixed results: while SIMD may be beneficial for some workloads, e.g., CPU-intensive ones with many floating-point operations, it exposes higher overhead than ILR in many applications we tested.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explorer ELZAR : Triple Modular Redundancy Using Intel AVX ( Practical Experience

Instruction-Level Redundancy (ILR) is a well known approach to tolerate transient CPU faults. It replicates instructions in a program and inserts periodic checks to detect and correct CPU faults using majority voting, which essentially requires three copies of each instruction and leads to high performance overheads. As SIMD technology can operate simultaneously on several copies of the data, i...

متن کامل

Elzar: Triple Modular Redundancy using Intel Advanced Vector Extensions (technical report)

Instruction-Level Redundancy (ILR) is a well known approach to tolerate transient CPU faults. It replicates instructions in a program and inserts periodic checks to detect and correct CPU faults using majority voting, which essentially requires three copies of each instruction and leads to high performance overheads. As SIMD technology can operate simultaneously on several copies of the data, i...

متن کامل

Improving the Fault Tolerance of a Computer System with Space-Time Triple Modular Redundancy

Triple Modular Redundancy is widely used in dependable systems design to ensure high reliability against soft errors. Conventional TMR is effective in protecting sequential circuits but can’t mask soft errors in combinational circuits. A new redundancy technique called the Space-Time Triple Modular Redundancy is presented in this paper, which improves the soft error tolerance of the combination...

متن کامل

Fault Tolerant Air Bubble Sensor using Triple Modular Redundancy Method

Detection of air bubbles in the blood is important for various medical treatments that use Extracorporeal Blood Circuits (ECBC), such as hemodialysis, hemofiltration and cardio-pulmonary bypass. Therefore a reliable air bubble detector is needed. This study presents the design of a new fault tolerant air bubble detector. Triple Modular Redundancy (TMR) method is used on the sensor section. A vo...

متن کامل

Reliability Prediction for Input-Output Cards of Distributed Control Systems in Power Plants

Distributed control systems in power plants are the latest trends to optimize and control performance. Reliability prediction of input-output components of distributed control systems have a key role to play in the power plant scenario. Triple modular redundancy is a means of increasing reliability of systems. Triple modular redundancy of three modules each containing m cards is studied in this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016